Blind Audiovisual Source Separation Using Sparse Redundant Representations
نویسندگان
چکیده
In this work, we present a method that jointly separates active audio and visual structures on a given mixture. This new concept, the Blind Audiovisual Source Separation (BAVSS), is achieved by exploiting the coherence existing between the recorded signal of a video camera and only one microphone. An efficient representation of audio and video sequences allows to build robust audiovisual relationships between temporally correlated structures of both modalities or, what turns to be the same, two parts of the same audiovisual event. First, video sources are localized and separated on the image sequence exploiting the temporal occurrence of audiovisual events and using a spatial clustering algorithm, without necessity of any previous assumption about the number of sources in the mixture. Second, the same audiovisual relationships together with a time-frequency probabilistic analysis allow the separation of the audio sources in the soundtrack, and, consequently, the complete Audiovisual Separation.
منابع مشابه
Blind Source Separation: the Sparsity Revolution
Over the last few years, the development of multi-channel sensors motivated interest in methods for the coherent processing of multivariate data. Some specific issues have already been addressed as testified by the wide literature on the so-called blind source separation (BSS) problem. In this context, as clearly emphasized by previous work, it is fundamental that the sources to be retrieved pr...
متن کاملBlind Separation of Sources in functional MRI sequences
Functional Magnetic Resonance Imaging (fMRI) is an important and popular tool for studying the human brain activity. In most fMRI scans, the BOLD technique is used, producing an image of the blood oxygenation level throughout the brain. High oxygenation levels represent high activity of brain regions responsible for performance of a specific task. The process can be modeled as a linear mixture ...
متن کاملOptimal Sparse Representations for Blind Deconvolution of Images
The relative Newton algorithm, previously proposed for quasi maximum likelihood blind source separation and blind deconvolution of one-dimensional signals is generalized for blind deconvolution of images. Smooth approximation of the absolute value is used in modelling the log probability density function, which is suitable for sparse sources. We propose a method of sparsification, which allows ...
متن کاملUsing Joint Sparsity for Blind Separation of Noisy Multichannel Signals
We call a set of vectors z[k] ∈ R jointly sparse, when for the most of them all m components are simultaneously [close to] zero. When recovering this set from [indirect] noisy observations using variational approach, joint sparsity prior can be expressed via convex penalty term ∑ k ‖z[k]‖2. In this work we explore joint sparsity in the context of blind source separation problem X = AS + ξ, wher...
متن کاملUnderdetermined Anechoic Blind Source Separation
In this paper, we address the problem of under-determined Blind Source Separation (BSS) of anechoic speech mixtures. We propose a demixing algorithm that exploits the sparsity of certain time-frequency expansions of speech signals. Our algorithm merges `-basis-pursuit with ideas based on the degenerate unmixing estimation technique (DUET) [1]. There are two main novel components to our approach...
متن کامل